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A 3 ST ACT 

The focus of a study (a f o 1 1 o w ' u p c o work h y 
Rosenthal and others which has indicated that teacher exp<=ctaRcies 
are fulfilled in ctuaont performance) wa.s the !;ino of teacher 
behavior which might eventuate following the experimental 
manipulation of an expectancy which could affect the performance of 
students. Subjects vero re male u r dorgr i dua r e teachers ^Tr>) h = 26) and 
randomly selected sixth and seventh gradois ’Ss) (N-104) from a 
iTidwestGMi school. Fach T was given s lesson plan designed to 
maxitaize disoussica and ly scores (randomly assigned and bearing nc 
relation to Ss^ actual ability) for each of th^ four stuients in her 
uO-minute mic t oteachi ng class. Using a specially developed 
observatio.i instrument, observers categorized bei’avioi in terms of 
six types of teacher reaction to student statements- Otlior data was 
obtained using the Rokeach Dogmatism Scale (Form F) tc assess Ts * 
authoritarianism and the questionnaire to chcch the credibility of 
the exporimental rra ni pu la t ions and to obtain information about Ts* 
perceptions of Ss* behavior* Two-way analyses of vaiiance with 
repeated measures were eraploye’ in analyzing data. Support was found 
for the so-called ’‘interaction quality hypothesis*" Ts did not differ 
in the amount qf attention divert to allegedly "gifted" and 
"nongifted" Ss, lut the pattern of attention and praise did differ. 

An attempt tc relate diflerential teacher ^’'ehavior to dogmatism war 
unsuccessful. (JS) 
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Toi^at d an Lxplanation o f t he Rosenthal -Jacohsoii Fi ndin^s 



Paniela C. Rr.bcvitJ? and Martin L, Maehv 
University of Illlrois at Urbana-Champaign 



PYGMALION ANALYZED: 




CATfON POS<IlONOR POUCV 



In their highly publicized book, P ygcnalivn in the Cla s sroom , Rosenthal and 
Jacobson (1968) report the astounding finding that erperiinentally created teacher 
expectations resulted in changed pcrfor-nance on the par/ of students. At the 
beginning of the school year teachers wore told to expect intellectual growth from 
certain of their students. Even though these potential ‘'bloomere'' haJ been randomly 
;»electei from the cUis^, they nevertheless fulfilled the prophesy, showing greater 
I'J gains over the ccur?e ot a year than did a group of ^^oivtrol students. 

Such intriguing results do net easily go unnoticed. Predictably cuough, they 
also do not go unchallenged. Thus mvr.^cious studies and rcviev;s have reconsidered the 
"pyg^^lior^ phenomenon" with nixed results and varied conclusions. (Claiborne, 1969; 
Melchenbaun e t al . . 1969; Minor, 1970; Snow, 1969; Ihornd'.kc, 1963; Thorndike, l9o9.) 
But whether or not the pygnal^on phenomenon is teal or Illusory it is of inportanre 
to CA^tcmlnc v;hethor and/or how expectancies aff'/cl tccohc"’ behavior. Most certainly, 
if it ic finally detornlned that teachers* cxpectutlcns do indeed eventuate in 
rcliahfct changoi in student pcrfoTnancc. one -’ould liave to inquire more specifically 
into the ii: cervonlng teacher • student inte/actiens which directly affect studc.nt 
behavior. Such the task of the pr^'sent study. More specifically, the goal is to 
test cne hypothesis regarding the effects of teacher expectations on teacher-student 
Interaction, Ibis hypothesis aS ( Rosenthal's work and can be referred to 

as the **interac tlon quality" hypothesis. 

Rosenthal and Jacobson only speculate as to "how teachert brought about intel- 
lectual competence simply bv expecc/!>g J t" (Rosenthal and Jacobson, 1968). Their 
data indicate that there was no . iffcrence In the amount of tine teachers spent 
with the students In the two g/oups, thus suggesting that 




it nav havt bee a a matter of the cype of InJeracticn 

which took place between the teachers and their pupils which 
served as th o r’ctcrnlrant of the expected JntePcctual gains 
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7.his ‘'interaction quality" hypotnesis has appeared before, as the preteired 
expl«"inatiou in the research on the effect of experimenter bias (Rosenthal, 1966). 

In some of these experimenter bias studies Es actually admit that they behavec or 
felt differently toward their "bright" and "less than bright" Ss (Rosenthal, 1966; 
Rosenthal and Fode, 1963; Rosenthal and Lawson, 1964). 

The interaction quality hypothesis, however, xs still only speculative. The 
studies on experimenter bias did not Identify specific £-£ interaction. The m*.»re 
recent classroom study (Rosenthal and Jacobson, 1968) also did nor investigate just 
what teacher bchavioi^ could be influencing student performance. In an unsuccessful 
attempt at replication of this study, Claiborne (1969) a».parently made an attempt to 
consider teacher-student interactions, but he provides no J^ra on this variable, 
IJelchcnbaum ct al . , (1969) In a scvdy which confirmed the Rosenthal- Jacobson 
findings, specifically and systematically considered teacher-student interectionj^ , 
Employing chc very global categories of "poi^vtlve," ‘'negative," and "neutral" they 
found some basis for concluding that expectations influenced the quality of teacher- 
student interaction, but in a cempLex anJ not cacily intei pretablo manner. Two 
teachers increased their positive interaction with "special" students; one teacher 
decreased her negative interactions with "special" students. T*ius expectations 
di f ferenti ally affected the quality of the teact-iors* interaction x^ith the expectancy 
students. Since the Meichenbaum et al ,, study involved only four teachers, their 
ml>ed results from very general categories cannot be tal:en as definitive support Cor 
the interaction quality hypothesis, t^at is needed is more detailed and extended 
consideration of teacher- student interaction following the manlpulatiorv of teacher 
expectations. 

One goal of the present r'»search, then, was to find out whether or not varying 
teacher expect'xtions eventuated in differential teacher- student interactions. 

Another question was also considered. Assuming that giving teachers an expectancy 
affects their behavior. It Is doubtful that al \ teachers are similarly affected, 

1^0 decades of research on authoritarianism (Klrscht and Dlllehay, 1967) would 

,€ft that degree of authoi i tari an! sn may wcl) mediate an iridividual ’ s confoinity 
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to an Imposed set or expectancy. Accordluglyi it vas hypothesized that individuals 
high in this trait Vtiould be more susceptible to an expectancy influence than those 
low in authcritarianism. 

Method 

Subject j 

Two different groups participated In this study. One grou^#, referred to as 
teachers (Ta), was composed of 26 female undergraduates enrolled In a child 
development course. All Is expressed interest in teaching as a career, but qodc 
had yet done her atjdeut teA<:hing. Although Ts were volunteers, their instructor 
did allow them to participate in this study In licju of a term project. It must 
be emphasized that Jb knew nothing of the experimental manipulations; they thought 
chey were participating in a pilot mlcrotpachlng project being conducted to give 
tiem additional teaching experience. 

The other group participating In the study Is referred to as students {£s). 

It included 104 alAth and seventh graders from a middle school In a small nidwestern 
city. These S: were randomly selected and given no instructions about hov they 
were to behave. 
heya u rcaent procedures 

A major problem of the present study was to find a procafure for effectively 
analyzing the quality of teacher-student Interaction relative to the purposes of 
the research. Host measures of te*xhcr-student interaction (Flanders, 1960; 

Perkins, 1964) or inter personal behavior In groups (Veick, 19Cf^) are not readily 
adaptable to microteachlng sltuatlcna. Moreover, the rather general analysis 
which these rating syatems typically provide would not reveal the validity of the 
Ir.toractlon quality hypothesis. A special Instrument was therefore developed. 
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Thle instcuaent requires a trained observer to record the incidence of six 
different teacher behaviors • These six Indexes are: (1) teacher attention to 

students' atatementa^ subdivided into attention to requested stateoients and 
attention to spontaneous student statements; (2) teacher encour a^ement of students' 
statements; (3) teacher elaboration of students' statements; (A) teacher iRnorlnK 
of students' statements; (5) teacher praise of students' statements; and (6) toacher 
criticism of students' statements. A detailed description of this observational 
procedure may be found in Rubovits (1970). It should be added that this instrument 
appears to provide sufficiently reliable assessment of teacher behavior. In a 
preliminary check before proceeding vith the study, dual raters were employed and 
interrater agreement was found to be 90 percent or above in all categories. 

In addition to assessing transactions in the microteaching situatioui 
personality and questionnaire data were obtained. An attempt w/is made to assess 
Ts' authoritarianism with the Rokeach Dogmatism ^cale, Fovea E (Rokeach, 1960). 

A questionnaire was used to cncck on the credibility of the experimental ti^&nipulation 
and to obtain information about Te' perceptions of Ss' behavior. 

Experioental procedur e 

Approximately one week prior to teaching, each T was <;iven k lesson plan which 
outlined the topic to be taught and which suggested points that could be covered. 

The topic of television was eelected as one which would readily elicit the partici* 
pation of sixth and seventh graders and would allow for a conaidarab'e amount of 
teacher-student interaction. The lesson plan was designed to maxlmlxc discjssion 
and also to allow Ts considerable freedom in adding and emitting points as well as 
in creating new e^iarsples. The Ts w^re nanibdad, however, to keep in mind the 
two objectives of the lesjon: (1) to gather general information about these Ss' 

viewing habits and (2) to get suggestions about possible class projects for a 
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unlt on television. Along with the lesson plan Is vere also given a brief 
description of Ss they would be rneeting. Principally, they were told that they 
would be teaching a group heterogeneous in terms of ability, since this "most 
closely approximates the typical classroom situation." 

For each teaching session there was a different group of four Ss. The Ss 
vere brought to an unused classroom where the study was conducted and where T 
was waiting. Before Ss arrived T was given a seating chart which not only contained 
each S's first name but also an IQ score and a label indicating whether or not 
he had been selected from the school's gifted program or from a "regular" track. 

The iQ scores anO labels had been randomly assigned to Ss and bore no relation 
to Ss' actual ability. It was hypothesized that Ts formed different sets or 
explications for those Ss with big!. IQs and the label gifted than they did for 
those Sn with lower IQ's and the label nongifted. 

Vfhen given the seating chart, T was told to famlliariza herself with the 
names and to examine clcjely Che scores and labels above each name. When Ss 
arrived, T asked each S Co ait in the seat designated on the chart. This 
procedure allowed T to identify each S by namSi As abe aeated each S, T was told 
to look directly at each £ and to read again to herself the IQ score and the 
label fOi* that pay ^culer S, in order to become as thoroughly acquainted with the 
child's competence as possible. The T then iutrcduced herself, explaining that 
she had come from the University of Illlnola to try cut ao*je new teaching materials 
with them, 

Keanvhile, an observer (g) seated herself two rows behind Ss and began 
categorising T'a behavior aa.aoott as T had aaeted £a and introduced herself. 

The 0 categorized behavior for 40 laimitei although the teaching lession itself 
lasted for 43 to 60 minutes. It should be stressed that 0 was not aware of which 
Sa had been labeled gifted and which two nengifbed. 

.) 
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After |he teaching sesalonp 0 and T dlscussrd vhat had taken place, \^lth 0 
being carefad not to reveal the random aaslgnicent of IQ scores and labels ■ Each 
T was then administered the postexperimental questionnaire as well as the Dogmatlam 
Scale, A month later Ta were given a thorough explanation of the study; the 
nature of the experimental manipulations was described and the possible implications 
of the results were discussed. 

Results 

Interaction! analysis 

The observation of Ts was conceptualized as a measurement operation. The 
frequency counts collected on each T were considered to be the seme 5 ^ l^«t 
scores, chus ellowiiig for the combining of scoresi Each T met with two different 
kinds of Ss, '^gifted*' and '‘nongifCcd*'; each kind of S was viewed as a test. Every 
T made a s^ore on each test, i,e., for every T a certain number of observations 
ere recorded :or her interaction with "gifted'* S^s a certain number for her 
interaction with "nongifted" ^s. These scores were treated as repeated measures 
of the srnno individual. The individual observations can properly be considered 
as ordinal data, just as responses to individual (dichotomously scored) test items 
are ordinal measures. In the sane way that we treat the sums of individual test 
item responses (total scores) as if they were interval data, so can we proceed 
with the frequency observations of the interactions as if they were Interval data. 

In the present case, two*vay analyses of variance with rapeated measures 
(Winer, 1962) «.cte employed. Dogmatism, high and low on the basis of a median 
split, was the nonreplicated main effect; ''gi£ted"-"nongl£ted" was the replicated 
sain effect, A separate analysis was conducted for each dependent measure of 
teacher interaction with "gifted*' and "nongifted** Ss, 

O 
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Table 1 presents the mean number of teacher responses recorded in each category 
vjith "gifted" and "nongifted" Ss. Table 2 presents the results of the separate 
ANOVA^s. 



First consider the differences teacher Interaction vlth "gifted" and 
"nongifted" Ss, or the differences due to latcl* Only for tvo of the categories are 
there statistically signlficar.t differences In teacher Interaction with "gifted" and 
"nongifted" fs. 

Across all Ts, significantly wore statements were r equested of "gifted" Ss 
than of "nongifted" Sc (category 13). Also aevoss all Ts the atatcruerits of "gifted" 

Ss were praised slgniilcantly more times than v/crc the statements of "nongifted" Ss 
(category 5). Significant dif f'^rences for these two catcg^;rics had oeen predicted. 

It also had been (xpseted that there \;ould be jto significant difference In the total 
amount of attention paid to "gifted" and ’nongifted" Ss. Implicit in the interaction 
quality hypothesis Ic the assumption that the amount of interaction remains constant 
across Ss. It Is, presumably, the variation In the quality of that interaction that 
is critical. Since the difference due to labeling tor category 1 (total attention) 
was not significant, It appears that In accord with the Interaction quality hypothesis 
the amoun t of teacher-student interaction was not warhedly affected by the expectancy 
conditions. What was predicted, and substantiated , v;as the hypothesis that there 
would be significant differences in the amount of teacher initiated interaction wlrh 
"gifted' find "nongifted" Ss. Conversely it was expected that there would be no 
significant differences in category lA (attention to unsolicited statements). In 
effect this category provided a measure of the amount of studen t, initiated interaction. 
Wo significant differences in this category allows for the inference that there was 
little difference in the spontaneity of vcrbaslty of Ss. Gltted Ss were not, there* 
fore, called upon more because they volunteered less than "nongifted" Ss, The 
O ferential recognition noted in category IB apparently is attributable to the 



(Tables 1 and 2 about here.) 
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Fcr tSic other four categories significant cliffti. cnees due to label had been 

predicted, but none of these hypotheses was supported. The Ts showed no difference 

in the amount of criticism of "gifted*' and "nongifted" Ss. They did not ignore the 

statenents of one type of S more thai ^:he statements of the other type. Finally, Ts 

did net differentially encourage or elaborate upon the statements of one type of S, 

Consider new the effect of Icve), of dcgmatlstn on teacher interaction. There 

were no significant differences in the behavior of Ts high in dogmatism and Ts low 

in dogmatism as measured by these cfd.cgories. It had been predicted that tlicrc would 

be a significant Interaction between the effect of labeling and the level of dogmatism. 

This prediction was not supported for any of the categories, It, therefore, must be 

concluded that the behavior of these 26 Ts did not indicate in any way that highly 

duginatlc teachers are more affected by the labeling than arc tcacliers lower in dogmatiem. 
Credibility of the experimental sltuitic n 

Through a postexperiment inteivicw and questionnaire, an extensive check was 

made on whether or n>t Ts accepted the experimental situation as it v^as presented to 

them. Although they were given ample opportunity to do so, no T expressed any 

suspicion of the hypothesis being tested. Fui thermorc , when Ts were asked to give 

thel^ own personal evaluations of the students, they expressed narked agrcecneut with 

the randomly assigned labels. Fifty-two students had been labeled "gifted" and 32 

"nonglftcd," but only in the case of eight of the "gifted" Ss and three of the 

"nongl .ted" Ss Old Ts express any reservations or dlsagrcrment with the labels. 

These data, as well as clinical observations obtained In the postexperiement inter* 

view, suggest overwhelmingly that Tf not only accepted the situation as planned bi*t 

also interpreted the behavior of Ss through the lens of a label. 

Olscusslon 

This study was conducted not to replicate the expectancy uffect, but to make 
a start at explaining it. ?n this regard the results provide support for the 
^ Interaction quality hypothesis which was suggested by Rosenthal and Jacobson (1968) 
was supported to some extent by Melchenbaum ei al . (1969). 
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In thfc present study, ^'gifted'' Se were called upon more and were praised 
more than *^nonglfted” £s» It is, of course, possible to speculate that Ts In 
this study, being inexperienced, may hsve been scared of bright students and lo 
were oversolicitous of them, se^^king out their opinions and being sure to 
praise them often. Whether or not the came results would be obtained with 
experienced teachers can only Le answered by attempting to replicate these 
results with experienced teachers However, Melchenbaum et al. (1969) Included 
one teacher, experienced, Who responded to expectancy students by increasing 
her positive interaction with them. 

It is, of course, interesting to speculate as to hov calling upon and 
praising expectancy students more could l<^ad to improved academic perfor*qance 
by these students, if in fact teachers in othci studies that found an expecta^^cy 
effect (Rosenthal and Jacobson, 1968; Mcichonbaun et al , , 1969) behave J as Ts 
in this study did. Befng given more of an opportunity to participate in the 
class could cause expectancy students to clarify their thoughta more through 
dialogue with the teacher and to demonstrate their proficiency more frequently. 
Receiving more praise has far-rcachlng Implications for improving the students* 
motivation and learning. From the present study, however, nothing can be 
concluded about S performance, given these oifferenccs in T performance. More 
research is needad to replicate this study's findings and to investigate the 
effects of differences in teacher behavior on student performance* 

It had been expected that differences between interaction with '^gifted" and 
'tiongifted*’ Sa would also appear in the categories of ignoring, elaboration, 
encouragement, and ctiticisa. That such did not occur nay be an experimental 
artifact. If the lesson had allowed for leas student diaevasion and demanded more 
clear*cut exhibitions of competence, such differences might have been found. Quite 
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possibly the teacher behaviors of ignoring, elaborating, and encouragxng deserve 
further consideration in different teaching situations. As for the category of 
teacher criticism, it may v;ell be that with the existence of strong cultural 
proscriptions regarding negative treatment of students, variations will not manifest 
themselves in the amount of criticisra of ’^gifted” and ^’nongifted” Ss. 

That dogmatism did not have the predicted effect on Ts' behavior is a dis- 
appointing, but not altogether surprising fi*»ding. After all the n was small (26), 
the sample relatively homc,.eneous , and the median split did not yield extremes of 
any real magnitude. The matter, however, ought to be pursued further. 

All in all, the study has provided important evidence to the effect that 
teacher expectatl br.s do affect teacher behavior. Moreover, it appears that teacher 
behavior is affected in such a way that it is not unlikely that student performanc/5 
would be influenced in the manner reported by Rosenthal and Jacobson (1968). However, 
3 Y/ord of caution is in order inasmuch as the originally predicted differences did 
net ojcur in four out of six cases. As indicated previously, there seem to be valid 
reasons in each case, why these expectations were not confirmed. Nevertheless one 
might wonder whether the two confirmed predictiois may in the overall scheme of things 
be best attributed to chance in spite of the high significance levels fovind in each 
instance. Certainly statistical theory does not necessarily lead to that conclusion. 
The probability of obtaining two significant differences out of a total cf six 
predictions is less than .05 ass ming the truth of a null hypothesis. Be that as It 
may, further work on the problem is indicated and, indeed, is already in progress 
(Rubovits and Haehr, 1971) to lend further support to the hypothesis that expectations 
influence the quality of teacher-student interaction, 
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3. A seemingly more straight forward app,:oach might involve the use of difference 
scores os the dependen , measure. However, as lianning and DuBois (1962) point 
out, a major problem is inherent in the use of such scores, vir.., the relie.bility 
of difference scores tends to decrease as the correlation between the scores 
increases. Therefc;:e, it seems preferable v^here possible to avoid *.{\e use of 
sneh composite measures and thus M the present case a repeated measures 
design employing a ’’gifted'* and "nongifted" score separatefy seems preferable. 
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Tabic i. — Means and Standard DcAd.ations of Interactions with "Gifted" and 
"Kongifeed" Students by Teachers High and Low in Dogmatism 
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Table 2, •••Analysis of Variance of Labels and Dogmatism 



Source df MS F 



v A) ANOVA of Labels and Dogmatism In Category 1 (Total Atte .itlon) 



Between 



Dogmatism (D) 


1 


78,77 


.16 


Sc within 


24 


476.81 




within 








Labels (L) 


1 


232.69 


1.13 


D j; L 


1 


60,31 


.30 


L Ss within 


24 


205.29 






(B) ANO^'A of Labels 


and Dogmatism In Category lA 






(Attention to ! 


Unsoll<lted Statements) 




Between 








Dogmatism (D) 


1 


7.69 


.02 


Ss within 


24 


318,04 




Within 








Labels (L) 


1 


24.92 


*12 


D X L 


1 


44.31 


.21 


L X Ss within 


24 


204,99 






(C) ANOVA of Labels 


and Doj^atlsm la Category IB 






(Attent’ on to 


Requested Statetnents) 




Between 








Doginatlim (D) 


1 


11.08 


.12 


Ss within 


24 


93,67 




Within 








Labels (L) 


1 


529.03 


17.33* 


D X L 


1 


15.17 


.49 


L X Ss within 


24 


30.58 





(D) AKOVA of Labels and Dogmattsa In Category ? (Enccurajgcm^nt) 



Between 

Dog^BAtism (D) 
Ss within 

Wxclilfi 

Labels (L) 

D X t. 

D X Ss within 



1 

24 



16.17 

98.12 



3.25 

12.33 

24.99 



.07 



.13 

.49 





1 

1 

24 



Table 2. --Analysis of Variance and Labels and Dogmatism (cont.) 



Source 


df 


MS 


F 



(E) ANOVA of Labels and Dogmatism in Categr /y 3 (Elabovat ion) 



Betveen 


Dogmatism (D) 


1 


5.56 


.04 


Ss within 


24 


J21.31 




Within 


Label (L) 


1 


3.25 


.06 


D X L 


1 


10.17 


.18 


L X Ss within 


24 


66.34 





(F) ANOVA of Labels and Dogmatism in Ka tegory A ( Ignoring) 



Between 








Dogmatism (D) 




15.85 


.48 


Ss within 


• 24 


33.16 




Within 








Label (L) 


1 


.69 


.07 


^ X L 


1 


14,16 


1.44 


L X Ss within 


24 


9,84 




(C) A^^0VA of 


Labels and 


OoKmatisir. in Category 5 (Praise) 




Between 








Dogmatism (D) 


& 


11.17 


.74 


Ss within 


24 


15.20 




Within 




1 




Ubdl (L) 


1 


167.33 


5.40* 


D X L 


1 


4.dl 


.15 


L X Ss within 


24 


30.94 




01) ANOVA of 


Labels and 


Dc. 'atism in Category 6 (Criticism) 




Between 








Dogmatism (D) 


1 


.08 


.01 


Ss within 


24 


6.67 




Within 








Label (L) 


1 


1.92 


.03 


D X L 


1 


.69 


.10 


L X Ss within 


24 


6.97 













p < .01 
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